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1 INTRODUCTION. 

It is a safe bet that everyone reading this is familiar with the quadratic 
formula. 

Fact 1 If ax 2 + bx + c = and a ^ 0, then one of the following holds: 



-b + Vb 2 - 4ac 

x = o ' 

2a 



-b - \Jb 2 - Aac 

% — ^ • 

2a 

One might say that this formula allows one to solve the quadratic with a 
pencil. There is an analogous formula for the general quartic equation, 

ax 4 + bx 3 + cx 2 + dx + e = . 



By this, we really mean four different formulas each of which gives one root 
of the equation. Each formula is expressible using only the operations of 
addition, subrtaction, multiplication, division, and extraction of roots. If 
you just want to know the formula or one way to derive it-and don't care 
about anything else-you can just skip to the last section of this paper. The 
formulas for the roots of a general quartic are listed and derived there. The 
derivation requires the solution of the general cubic (for which we give only 
hints at the derivation). Readers should be forewarned: this paper is a bit 
like a tourist trap. There is a main attraction, but you don't get to see it until 
you pass many of the souvenirs that are available along the way. The real 
goal of the paper is to expose readers to a number of mathematical tidbits 
related to the solution of the general quartic. 
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Figure 1: Curves in pencil ([T]). 

There is a mathematical notion of a "pencil" that is rather cool and has 
gained prominence in geometry and topology in recent years. Pencils were 
also studied extensively by algebraic geometers in the nineteenth century. 
The geometric picture behind the solution to the quartic presented in the 
last section is a particular pencil associated with the quartic. We begin with 
a description of a pencil from the view point of a topologist. 

2 PENCILS. 

The family of quadratic sections given by 

y 2 - 7y + 6x + X(y - x 2 ) = (1) 

for the different values of the parameter A is an example of a pencil of curves. 
Several members of this family (corresponding to A = —5, A = —2, A = 0, 
A = 1, A = 2, A = 5, and A = oo) are graphed in Figure 1, which is the 
key to the solution of the general quartic given in this paper. By looking at 
it carefully you may even be able to figure out the general formula on your 



2 



20 




15 



10 



-5 



-10 



-15 



-15 



-10 



-5 



10 



15 



20 



-10 



-7 . 5 



-5 



-2.5 



2.5 



Figure 2: The pencil of cubics. 



own. If you are asking whether A is complex or lies in some other field, you 
are not an algebraic geometer. It is standard practice in algebraic geometry 
to consider equations as "functors" that associate a geometric object to each 
different field or "set of numbers." The figures displaying curves are the 
pictures for the real case, and the later figures displaying surfaces depict the 
complex case. 

The family of curves obtained as the zeros of the family of polynomials 
of degree three in two variables given by 



for varying A is a very standard example of a pencil of curves. Several of 
these curves (corresponding to A = —1, A = 0, A = 2, A = 10, and A = oo) 
are graphed in the left frame of Figure 2. Two of the generic curves (the 
curves with A = 2 and A = — 1) are singled out for special attention in the 
magnified right frame of Figure 2. If you look carefully you will see that these 
curves have exactly nine points in common (see the left frame of Figure 3 
for a further magnification of the crossings near the origin.) Indeed, every 
curve in the pencil passes through each one of these nine points, which are 
called the base points of the pencil. For any other point in the plane there is 
a unique member of the pencil that passes through it. One of the singular 
curves (A = 0) is displayed in the right frame of Figure 3. The singular 
curve is called a fish tail by topologists and a nodal cubic curve by algebraic 



y 2 - 2x 2 (x + 8) + \{x 2 -(y+ l) 2 {y + 9)) = 



(2) 
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Figure 3: Curves from pencil (J2J). 

geometers (A formal definition of generic and singular curves is given later in 
this section.) Algebraic geometers who tend to be rather smart, draw pictures 
of pencils similar to Figure 2, but realize that much more is happening than 
the picture shows directly. 

We will now look at the geometry of pencils in greater detail, because the 
geometry is absolutely beautiful and really important. The first point that 
arises is that it is appropriate to use algebraically closed fields of numbers 
such as the field C of complex numbers. The reason for this is that any poly- 
nomial p(x) — do + • • • + ddX d of positive degree d with complex coefficients 
has exactly d zeros in C, provided that the zeros are counted with multiplic- 
ity. The same statement cannot be made about polynomials over the real 
numbers. To understand a polynomial fully one must realize that its graph is 
what normal people would call a surface (What an algebraic geometer calls a 
surface is actually four dimensional.) For example, the picture that an alge- 
braic geometer would draw of the curve with equation y 2 = x 3 — x is displayed 
in Figure HI We can describe the actual shape of this curve by starting with 
the observation that, with the exception of the points x = —1, x = 0, and 
x — 1, every x- value corresponds to two y- values. This means that we can 
construct the curve from two copies of the complex plane. The left side of 
Figure [5] displays the points x = —1, x = 0, and x = 1 in the complex plane, 
together with a segment connecting points x = — 1 and x = and a ray from 
x = 1 to infinity. The segment and ray are called branch cuts. To go further 



4 



-4-2 2 4 

Figure 4: The curve with equation y 2 = x 3 — x. 

we use Euler's formula for the complex exponential: e 10 = cos 9 + i sin 9. 
Consider the circle described by x = e^* _7r - ) / 4 for t in [0, 2%] and think about 
the corresponding y-values. We have y = ±y/x\/x 2 — 1. For t = we pick 
—i/2 = e~ m ' 2 /2 as the top sheet (principal branch) of y/x. The values of ^Jx 
are given by e i( - t ~ 7T ^ 2 /2 as x traverses the circle. Notice that, after traversing 
the circle (i.e., after t goes from to 2n), y/x changes to i/2, which is on the 
other sheet (branch) (This is the first instance that we will encounter of what 
is called monodromy.) The value of y also switches from one sheet to another 
as x traverses the circle. We would like to match the y-values up consistently 
with the x-values. This is why we introduce the branch cuts. The right side 
of Figure [5] displays the result of cutting open two copies of C along the 
branch cuts and glueing them together in a way consistent with the equation 
y 2 = x 3 — x. The top sheet (principal branch) is left alone, whereas the 
bottom sheet (other branch) is reflected in the real axis before being glued to 
the principal branch. The resulting object is known as the Riemann surface 
for the function y = ^/x 3 — x. Talk about this with the mathematically adept 
or read all about it under the heading "Riemann surfaces" in your library. 

A second issue that comes up is that it is useful to add points at infinity. 
The reason for extra points at infinity is to ensure that the zero locus of a 
polynomial of degree d in two variables always intersects the zero locus of a 
polynomial of degree e in two variables in de points. Without extra points 
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Figure 5: The Riemann surface of y = y/x 3 — x. 



most pairs of lines in a plane intersect in one point. Properly adding extra 
points guarantees that every pair of distinct lines intersect in exactly one 
point. This is the motivation for the definition of complex projective space. 

The n- dimensional complex projective space is the collection of equivalence 
classes of nonzero ordered (n + l)-tuples (20, • • • , z n ), where two (n+l)-tuples 
are equivalent if they are complex scalar multiples of each other. (This is very 
similar to the definition of a rational number: the fractions 2/3 and 4/6 are 
the same.) The n-dimensional complex projective space is denoted by CP™, 
and its elements are denoted by [z : z\ : . . . : z n ] . The set of points of the 
form [1 : Z\ : . . . : z n ] is a copy of n-dimensional complex space C n contained 
in CP" - . The points at infinity are the points of the form [0 : z% : . . . : z n ]. In 
the case of CP 1 , it is natural to think of [w : z] as the fraction z/w, so that 
[0 : 1] represents infinity. The collection of complex numbers, together with 
this one extra point, is known as the Riemann sphere (or extended complex 
plane). It is not too difficult to extend this to a method to add points at 
infinity to many spaces. 

The spaces that algebraic geometers study are the solution loci to systems 
of polynomial equations. If f{z%, . . . , z n ) is a polynomial function defined on 

C ra , one defines the associated homogenous polynomial by z f(f^, ■ ■ ■ , f^)- 
(A function is homogenous of degree d if /(Az) = X d f(z).) Points at infinity 
are added to the solutions of a system of polynomial equations by consid- 
ering all solutions to the homogenous polynomials in C™. To see how this 
works, consider the example from Figure HI The homogenous polynomial in 
question here is 



The only zeros [w : x : y] of this polynomial in C n with w = have x = 
as well, hence there is only one such zero, namely, [0:0:1]. Any zero with 
w/fl may be scaled to get a zero with w = 1. Such points correspond to 





Figure 6: The elliptic curve with equation wy 2 — x 3 + w 2 x = 0. 

the zeros of y 2 — x 3 + x. The result of glueing the one unique extra point at 
infinity ([0 : : 1]) to Figure [5] is displayed in Figure [61 The process that we 
just described is called projectivization. Talk to the math adept or look in the 
library for more information. 

We can now apply the projectivization proceedure to the pencil from 
Figure 2. Following the algebraic geometers, we will use the bizzar looking 
notation if (CP 2 ; 0(L 3 )) to signify the vector space of all homogenous poly- 
nomials of degree 3 in three variables. Don't be intimidated by the notation. 
It translates to 

P°(CP 2 ; 0(L 3 )) = {f{ Xi y, z) G C[x, y, z] : /(Ax, Xy, \z) = X 3 f(x, y, z)}. 

This is a ten-dimensional complex vector space. (Can you find a basis for 
it?) A linear system on CP™ is by definition a subspace of H°(CP n ; O(L)). 
When one speaks of the dimension of a linear system, one usually means the 
dimension of the projective space associated with the linear system. This is 
one less than the vector space dimension of the linear system. 

Definition 2 A pencil on CP™ is a one- dimensional linear system (hence a 
two-dimensional linear subspace of P°(CP™; O(L))). 

Wow, that was a lot of terminology and notation to describe something 
fairly simple: namely, a family of polynomials of the form "parameter times 
p plus second parameter times q" in which p and q are linearly independent 
homogenous polynomials of the same degree. This entire mess exists because 
it helps to describe interesting features of projective algebraic varieties, and 
even more general spaces. The pencil in Figure 1 is in disguise. Written as 
a pencil, it takes the form fi(y 2 — 7yz + 6xz) + X(yz — x 2 ) = 0. This morphs 
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into expression (jTJ after substituting \i = 1 and z = 1. The base points are 
the common zeros of the polynomials in the pencil. 

Pencils have been defined in more general settings including spaces called 
projective algebraic varieties and symplectic manifolds. A projective alge- 
braic variety is the locus in projective space of zeros to a set of homogenous 
polynomials. To motivate the definition of a symplectic manifold, recall how 
to write Newton's equation of motion in Hamiltonian form. Let H denote 
the total energy of a conservative mechanical system expressed as a func- 
tion of position coordinates q±, . . . ,q n and momentum coordinates Pi, ■ ■ ■ ,p n - 
A good example is the falling ball, which has H = p 2 / (2m) + mgq. The 
equations of motion can be written in the form 

,. f dH dH. mrr 



Where J is the matrix given by 

J = 



1 
-1 



Note that the first equation recovers the definition of the momentum and the 
second equation recovers the standard F = ma. The quadratic form with 
matrix J (generalized by the analogous block matrix for higher dimensions) 
is the standard symplectic form. A space obtained by identifying open sub- 
sets of K 2n via maps that preserve the standard symplectic form is called a 
symplectic manifold. Notice that many more maps preserve the symplectic 
form than the maps that preserve the dot product. A map from M 2 to itself 
preserves the symplectic form if and only if it is area preserving. All smooth 
projective algebraic varieties are symplectic manifolds More information 
about symplectic manifolds can be found in pQ. The vector space on a general 
symplectic manifold X corresponding to the space of homogenous polynomi- 
als of a fixed degree that one considers when studying CP n is denoted by 
H (X;O(L)). 

Returning to our example, note that the pencil in Figure 2 is nothing 
more than the two-dimensional subspace of if (CP 2 ; 0(L 3 )) given by 

{fi{zy 2 - 2x 2 (x + 8z)) + \{zx 2 - (y + z) 2 {y + 9z)) :/i,AeC}. 

Each curve in the figure is the locus of zeros of one of the polynomials for 
which (/i, A) 7^ (0, 0). Two polynomials correspond to the same curve if and 
only if they are scalar multiples of each other. 
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It is natural to label the curves associated with a pencil by elements 
[fi : A] of one- dimensional complex projective space. We can now start to 
describe the geometry associated with a pencil. It is worth considering pairs 
consisting of a label of a curve in the pencil together with a point on that 
curve. In the case of Figure 2 we obtain 

E(l) = {([/i : A], [x : y : z\) E CP 1 x CP 2 : 

/2(zy 2 - 2x 2 (x + 8z)) + X(zx 2 - (y + z) 2 (y + 9z)) = 0} , 

which is a subset of CP 1 x CP 2 called the rational elliptic surface. There is a 
projection map tt 2 : E(\) — > CP 2 given by vr 2 ([/i : A], [x : y : z]) — [x : y : z]. 
With the exception of the nine points where zy 2 — 2x 2 (x + 8z) = and 
zx 2 — (y + z) 2 (y + 9z) = simultaneously, every point in CP 2 has exactly one 
preimage in E(l). The inverse image of any of the nine intersection points is a 
copy of the Riemann sphere CP 1 . We say that E(l) is the result of "blowing 
up" CP 2 at nine points, while CP 2 is the result of "blowing down" nine 
spheres in E{1). Algebraic geometers say that E{1) and CP 2 are birationally 
equivalent because they are related by a finite number of blow-ups. 

To get deeper into the geometry, consider the other projection map i\\ : 
E(l) — > CP 1 , which is given by 7Ti([/x : A], [x : y : z\) = [fi : A]. The 
critical points of 7Ti are the points where its derivative is equal to zero. (In 
general critical points are the points where the derivative of a mapping is not 
surjective.) To interpret this in the present context, recall that [x : y : z\ 
and [/i : A] are equivalence classes. Provided that z ^ and /i ^ we may 
represent these points in the form [x : y : 1] and [1 : A]. Substituting z — 1 
and n = l into the equations that define E{1) yields 

y 2 -2x 2 (x + 8) 
(y + l) 2 (y + 9)-x 2 - 

The critical points with z ^ and /i ^ can now be found by setting the 
partial derivatives of A with respect to x and y equal to zero. Critical points 
with z = or fi = can be found similarly. The critical points of 7Ti with 
their corresponding A and fj, values are listed in Table 1 (first two significant 
digits only). 

Away from the critical values (a critical value is the image of a critical 
point-any other point in the codomain is a regular value) the inverse image 
7r ] _1 ([ / u : A]) is a smooth cubic similar to the curve in Figure 6 (The inverse 
image of a regular value is called a generic curve.) The inverse image of a 
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Table 1: Critical points. 



X 


y 


z 


A 


n 








1 





1 





4.8 


1 


0.05 


1 





-3.8 


1 


0.35 


1 


-8.2 


-6.2 


1 


7.8 


1 


1.0 


-1.0 


1 


17 


1 


4.1 + lli 


-6.3-0.016i 


1 


20 + lli 


1 


4.1 - lli 


-6.3 + 0.016i 


1 


20 - lli 


1 


( -0.5 -0.87i 


-1 - 0.00046i 


1 


15 - .87i 


1 


-0.5 + 0.87i 


-1 + 0.00046i 


1 


15 + .87i 


1 


-16 + 0.011i 


3.6 - HOi 


1 


0.0014 + 0.01H 


1 


-16 — O.Olli 


3.6 + 110i 


1 


0.0014 -O.Olli 


1 





-1 


1 


1 






critical value is a fish tail (The inverse image of a critical value is called a 
singular curve.) For example 7rf 1 ([1 : 0]) is the fish tail depicted in the right 
frame of Figure 3. The corresponding critical point has x — 0, y — 0, and 
z = 1 (the coordinates of the point where the tail meets the fish). The inverse 
image of a circle that does not enclose a critical value is just a "cylinder," 
as in the left frame of Figure 7, where each fiber (vertical line) is really the 
donut shape of a smooth cubic (Figure 6). The inverse image of a circle 
that surrounds a critical point is a fiber bundle (i.e., "twisted cylinder" as 
in the right frame of Figure 7.) We say that the cylinder or Mobius band 
fibers over the circle. A space that fibers over a surface with only nonde- 
generate critical points and only one critical point per critical value is called 
a Lefschetz fibration. (A critical point is called nondegenerate if there are 
coordinate systems around the point and the image of the point for which 
the projection map takes the form n(zi, . . . , z n ) = z\ + . . - + z^.) A space that 
can be turned into a Lefshetz fibration by blowing up a number of singular- 
ities is called a Lefschetz pencil. Lefschetz fibrations may be described up to 
topological equivalence by combinatorial data that describes the monodromy 
(i.e., "twisting" around each singular fiber.) It turns out that any projective 
algebraic variety has the structure of (i.e., is the domain of) a Lefschetz pen- 
cil obtained by intersecting the variety with every complex codimension one 
hyperplane that contains a given generic (one must avoid a small set of bad 
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Figure 7: Trivial and nontrivial fibrations. 



ones) complex codimension two hyperplane. The recent activity related to 
Lefschetz pencils stems in part from a pair of theorems, one by Simon Don- 
aldson and the other by Bob Gompf. The theorem of Donaldson states that 
any symplectic manifold (an important class of objects in modern geometry) 
admits the structure of a Lefschetz pencil [2j. Gompf s theorem establishes 
that any Lefschetz fibration is a symplectic manifold [I]. You guessed it. 
Talk to the mathematically adept or read all about it at your library. We now 
move from pencil propaganda to the solution of the general quartic. 



By scaling, any quartic equation can be written in the form 

x 4 + ax 3 + bx 2 + cx + d = 0. 

We substitute x = u + k into this equation, combine like terms, and solve for 
the value of k that forces the coefficient of u 3 to zero. This gives k = —a/4, 
whence u 4 + pu 2 + qu + r = with 



3 THE QUARTIC FORMULA. 




3a 2 



ab a 3 




ac a 2 b 3a 4 




Now set v = u 



2 . Solving the system of equations 



v = u 2 , v 2 + pv + qu + r = 



is easily seen to be equivalent to solving the quartic w 4 + pu 2 + qu + r = 0. 
The quartic corresponding to the pencil in Figured] is x 4 — 7x 2 + 6x = 0. In 
fact, for any fixed parameter A the quartic u 4 + pu 2 + qu + r = is equivalent 
to the system 

v = u 2 , v 2 + (j> + \)v — \u 2 + qu + r = 0. (3) 

The pencil associated with Figure 1 has the form of the second equation in 
(j3J). Geometrically, the second equation represents a pencil of conic sections 
that share four base points. These four points provide the desired solutions 
to the quartic. In general, finding the points of intersection of a pair of conic 
sections (say, a parabola and an ellipse) is difficult. However, the pencil of 
the second equation in ([3]) has a singular fiber. One (corresponding to A = 1 
in the specific pencil (JTJ) is displayed in Figure [TJ This singular fiber may 
be described as the union of two lines. Finding the points of intersection of 
a line and parabola is routine. There are actually two more singular fibers 
in this pencil. (Can you find them?) 

We can rewrite the second equation in (j3J) as 

(v + \{p + \)?-\{u-^L) 2 = \{p + \) 2 -^-r, 
Provided that 

i (p+A)2 -fx- r = 

(i.e., A 3 + 2p\ 2 + {p 2 — 4r)A — q 2 = 0), we can conclude that 

(v + ^(p + \))±^X(u-^-) = 0. 

Substituting v = u 2 into these two equations produces two quadratic equa- 
tions that we can solve for u. The quartic formula then reads as follows: 



Fact 3 If x 4 + ax 3 + bx 2 + cx + d = 0, then 



x - 

or 

x 



(-VX ± ) J\-2(p + X + ^=^j 12 - a/4 
(VX ± ^\-2(p+\--±=) S J 12 - a/ A , 



12 



where 




q = c — 




and A is an arbitrary solution to 



A 3 + 2p\ 2 + {p 2 - 4r)A - q 2 = 0. 



(4) 



The solution to the quartic equation is only useful if one has a method 
for solving equation (j3J. One solution to the general cubic equation is given 
in the following fact: 

Fact 4 If a, b, and c are arbitrary complex numbers and if 



wherep = (36-a 2 )/3 andq = (2a 3 -9a6+27c)/27, then x 3 + ax 2 + bx + c = 0. 

The other solutions to the general cubic are expressible as linear combinations 
of the two cube roots in equation One could try to derive the solution by 
constructing an appropriate pencil. Alternatively, the general cubic can be 
solved by making a change of variables and scale to put it in the "reduced" 
form 4-u 3 — 3u — k = 0, which can then be solved by comparison with the 
identity 4cos 3 6 l — 3cos# — cos(3#) = 0. Euler's formula for e t6 is used to 
interpret cos(| arccosfc) for values of k that do not lie between ±1. 

Applying the foregoing method to the quartic associated with Figure [fl 
one rewrites the pencil in the form 



One solution is A = 1. This leads to (y — 3) ± (x — 3) = 0, which can 
be combined with y = x 2 and solved to give the roots 0, 1, 2, and —3 of 
x A — 7x 2 + 6x = 0. 

There is a long history associated to the solution of the quartic equation. 
The historical notes (written by Victor Katz) in the abstract algebra text 



x = 



\j-q/2 + v^" 2 /4 + p 3 /27 + \J-q/2 - v 7 ? / '4 + p 3 /27 - a/3 , (5) 



(y+^(A-7)) 2 -A(x-|) 2 = i(A-7) 2 
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Setting the right-hand side equal to zero and simplifying gives 



A 3 -15A 2 + 49A-36 = 0. 
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on my shelf run from the Babylonians to the Chinese to the Arab Omar 
Khayyam, through several Europeans (Cardano, Tartaglia, Ferrari) and just 
keeps going [3]. 

One of the truly amazing things about mathematics is that ideas that 
at first seem unrelated are often related in surprising ways. Heading out to 
solve the quartic equation, it is possible to get lost on a detour studying the 
geometry of pencils. There are many other forks in the road that we passed. 
One can ask, for instance, if there is a similar formula for the solution to the 
quintic equation. The answer is that there is no formula for the solution to 
the general quintic equation that can be expressed in terms of a finite number 
of radicals and field operations. There is, however, a solution of the general 
quintic expressed in terms of hypergeometric functions. This fork leads deep 
into algebraic territory and the realm of differential equations. The fork 
down the symplectic geometry path takes one deep into the mathematics 
of mechanical systems. Hopefully this little outing will encourage readers 
to spend some time getting lost in and picking up some souvenirs from the 
world of mathematics. 
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